RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the fdllowing computer readable 
form: 

Application Serial Number 
Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21helD@.uspto.eov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(a),uspto.eov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 




Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. jf- , . . 



Checker Version 3.0 can he down loaded from the USPTO webs ite at the following address; 

http://www.uspto.gov/web/ofrices/pac/checker 



Raw Sequence Listing Error Summary 



RRROR DETECTED 

ATTN: NEW RULES CASES; 

I Wrapped Nuclcics 

Wrapped Aminos 



Invalid Line Length 

Misaligned Anuno 
Numbering 

Non-ASCII 



Variable Length 



Patentin 2.0 
"bug" 



7 Skipped Sequences 

(OLD RULES) 



(NEW RULES) 



_Usc of n*s or Xaa*s 
(NEW RULES) 




10 \/ Invalid <213> 
Response 



Uscof<220> 



12 



13 



Palcnlln 2.0 
' "bug- 



Misuse of n 



c,.nnp.rrFnrORRECTION SERIAL NUMBER: ^9/^^ ^ 

: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS. WHIc/wERE INSERTED BY PTO SOm 

The numberAext .t the end of each line "^mpped" down lo the nex. line. This «^~'ijy«|[ 
W.S retrieved in . word processor after creating it Ple«e adjust your nght margin to .3. this w.U 
prevent "wrapping." 

The rules require that a line not exceed 72 characters in length. Ihrs includes white spaces. 

The numbering under each 5* amino acid is misaligned. Do not use Ub codes between numbers; 
use apace characters, instead. . _ 

The submitted file was not Mved in ASCII(DOS) text, as r«,ulrtd by the Sequence KmU^ PI**** 
ensure your lubsequent submission b saved III ASCII t^xl. 

«:Muence/s^ contain n's or Xaa's representing more than bne rtsidue. Per Sequence Rule., 
each n o^XiTTi^ only represent a slnel. residue. Please present the maximum number of each 
'^ZZ'aZ ^.ble len Jh and indicate in the <220>-<223> section that some may be missmg. 

A "buE" in Patentin version 2.0 hal«u,ed (tit <22p>-<223> auction to be ntissing from ~ ■- 

.«,ulLerfsr Normally. Patentin wwld^tomatically g^aipate this section front Ihe 

S^XeJu^ amino acid seque'nce. This applies to the mandatory <220>-<223> secUons for 
ArimcUl or Unknown aequences. 

Seauence(s^ missing. If intentional, please insert the following Uiies fo^ each skipped sequence: 
mVoRMATIONFORSEQIDNO:X:(in^ . ... 

(2) CHARACTERISTICS: (Do not insert any subheadings under this hcadmg) 

(S SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust Uic "(") NUMBER OF SEQUENCES:" response to Include Uie skipped sequences. 



8 Skipped Sequences Sequcnce(s) 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<2 10> sequence id number 
<400> sequence id number 
000 

Use ofn's and/or Xaa*s have been detected in the Sequence Listing. 

Per I 823 of Sequence Rules, use of <220>-<223> is MANDATORY ,f n's or Xaa^s are present. 

L <Sto <^^^^^^^ section, please explain location of n or Xaa. and which residue n or Xaa represents. 

P . ifi^i nf^enucnce Rules the only valid <213> responses are: Unknown, Artificial Sequence, or 
sdeJtffic leToenu^^^^^^^^^ <220>-<223> section is required when <2.3> response is Unknown or 
is Artificial Sequence 

cin„.n<-^.^ missine the <220> "Feature" and ass.ociated.numeric identifiers and responses. 

SofS>"T^^ U M^I>ATORY if <2 13> "Organism" response is "Artificial Sequence" or 
"llnlrnown" Please explain source ofgenetic material in <220> to <223> section. 
(SeSX.lS«e"Coi/1998, Vol. 63. No. 104. pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrvpted file. 
Sttng irmissmg mandatory numeric identifiers and responses (as indicated oji raw sequence 
IS Lead, please use^ile Manager" or any other m«,ual means to copy file to fioppy disk. 

n can only be used to represent a single nucleotld* in a nucleic acid sequence. N is not used to represent 
any value not specifically a nucleotide. 

AMaMH -Biotechnology Systems Branch -08/2 1/200 1 



Th. Wes of errors shown exist Ih^ushou. the Sequene; Listing, ilease checlc 
subsequent sequences for similar errors. 
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OIPE 



RAW SEQUENCE LISTING DATE: 08/15/2001 

PATENT APPLICATION: US/09/700,158 TIME: 07:36:46 

Input Set : A:\Seq.txt 

Output Set: N:\CRF3\08132001\l700158.raw 

Does Not Comply 

3 <iio> APPLICANT: BowNEss, RACHEL Oorrected Diskette Ne8d©d 

4 MCMICHAEL, PAUL t^JO^eue iMeea^Q 

5 JAMES, ANDREW Ci^^^MV^ I ^ 
7 <120> TITLE OF INVENTION: NOVEL MOLECULE AND DIAGNOSTIC METHOD ^ 

9 <130> FILE REFERENCE: P02077US0/10025595 

11 <140> CURRENT APPLICATION NUMBER: 09/700,158 

12 <141> CURRENT FILING DATE: 2000-11-10 

14 <150> PRIOR APPLICATION NUMBER: PCT/GB99014 81 

15 <151> PRIOR FILING DATE: 1999-05-11 
17 <160> NUMBER OF SEQ ID NOS : 9 

19 <170> SOFTWARE: Patent In version 3.0 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 9 

23 <212> TYPE: PRT 

24 <213> ORGANISM: INFLUENZA 
26 <400> SEQUENCE: 1 

28 Ser Arg Tyr Trp Ala lie Arg Thr Arg 

29 1 5 

31 <210> SEQ ID NO: 2 

32 <211> LENGTH: 10 

33 <212> TYPE: PRT 

34 <213> ORGANISM: HIV 
36 <400> SEQUENCE: 2 

38 Lys Arg Trp lie lie Met Gly Leu Asn Lys 

39 1 - 5 10 

41 <210> SEQ ID NO: 3 

42 <211> LENGTH: 17 

43 <212> TYPE: PRT X J ^ aaAoS< rt^p(^^^^ 
C— > 44 <213> ORGANISM: ARTIFICIAL PEPTIDE ^ JC^rOrc^ » I |M 0^^^^\ » 

4 6 <220> FEATURE: - Prr\^t^CAcJl St^CkAMA^ 

47 <221> NAME/KEY: misc feature . ^ 



48 <222> LOCATION: (1)..(17) 

4 9 <223> OTHER INFORMATION: Biotinyiation Sequence' 
52 <400> SEQUENCE: 3 

54 Giy Ser Leu His His lie Leu Asp Ala Gin Lys Met Val Trp Asn His 

55 1 5 10 15 
57 Arg 

59 <210> SEQ ID NO: 4 

60 <211> LENGTH: 9 

61 <212> TYPE: PRT 

62 <213> ORGANISM: CMV (HLA-A2 CMV) 
64 <400> SEQUENCE: 4 

66 Asn Leu Val Pro Met Val Ala Thr Val 

67 1 5 

69 <210> SEQ ID NO: 5 

70 <211> LENGTH: 26 

71 <212> TYPE: DNA 

The types of errors shown exist throughout the Sequence Listing. Please check \ 
< subsequent sequences for similar errors, 

file://C;\Crf3\Outhold\VsrI700 1 5 8.htm 8/1 5/0 1 
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RAW SEQUENCE LISTING DATE: 08/15/2001 

PATENT APPLICATION: US/09/700,158 TIME: 07:36:46 

Input Set : A:\Seq.txt 

Output Set: N:\CRF3\08132001\I700158.raw 



1/ 

26 



(1) . . (26) 



) NO: 6 
1: 26 
DNA 

CSM: ARTIFICIAL ty 



2> LOCATION: (1) . . (26) 
3> OTHER INFORMATION: DNA Prime 
0> SEQUENCE: 6 

gccttggcct tgctgatctg tgtctc 26 
10> SEQ ID NO: 7 



(1) . . (21) 



1: 67 

DNA /' 
CSM: ARTIFICIAL ^ 

IE: 

CEY: misc_f eature 
[ON: (1) . . (67) 
INFORMATION: DNA Primer 



I: 22 
PRT 



4-0 chAtu^i^ 



^ ^y. , 




60 
67 




<222> LOCATION: (1)..(22) 
<223> OTHER INFORMATION: Fusion protei 
<400> SEQUENCE: 9 

Pro Leu Thr Leu Arg Trp Glu Gly Ser Leu His His lie Leu Asp Ala 
15 10 15 



file://C:\CrO\Outhold\VsrI700 1 58 .htm 
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RAW SEQUENCE LISTING DATE: 08/15/2001 

PATENT APPLICATION: US/09/700,158 TIME: 07:36:46 

Input Set : A:\Seq.txt 

Output Set: N:\CRF3\08132001\I700158.raw 

152 Gin Lys Met Val Trp Asn 

153 20 
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VERIFICATION SUMMARY DATE: 08/15/2001 

PATENT APPLICATION: US/09/700,158 TIME: 07:36:47 

Input Set : A:\Seq.txt 

Output Set: N:\CRF3\08132001\I700158.raw 

L:44 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:3 
L:72 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:5 
L:88 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:6 
L:104 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:7 
L:120 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:8,^^ 
L:138 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:9 
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